Asynchronous Execution of the Fast Multipole Method Using Charm++

نویسندگان

  • Mustafa Abdul Jabbar
  • Rio Yokota
  • David Keyes
چکیده

Fast multipole methods (FMM) on distributed memory have traditionally used a bulk-synchronous model of communicating the local essential tree (LET) and overlapping it with computation of the local data. This could be perceived as an extreme case of data aggregation, where the whole LET is communicated at once. Charm++ allows a much finer control over the granularity of communication, and has a asynchronous execution model that fits well with the structure of our FMM code. Unlike previous work on asynchronous fast N-body methods such as ChaNGa and PEPC, the present work performs a direct comparison against the traditional bulk-synchronous approach and the asynchronous approach using Charm++. Furthermore, the serial performance of our FMM code is over an order of magnitude better than these previous codes, so it is much more challenging to hide the overhead of Charm++.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of Multipole and Empirical Relations Methods for Effective Index and Dispersion Calculations of Silica-Based Photonic Crystal Fibers

In this paper, we present a solid-core Silica-based photonic crystal fiber (PCF) composed of hexagonal lattice of air-holes and calculate the effective index and chromatic dispersion of PCF for different physical parameters using the empirical relations method (ERM). These results are compared with the data obtained from the conventional multipole method (MPM). Our simulation results reveal tha...

متن کامل

Data-Driven Execution of Fast Multipole Methods

Fast multipole methods have O(N) complexity, are compute bound, and require very little synchronization, which makes them a favorable algorithm on next-generation supercomputers. Their most common application is to accelerate N -body problems, but they can also be used to solve boundary integral equations. When the particle distribution is irregular and the tree structure is adaptive, load-bala...

متن کامل

A New Technique for the Calculation of Colliding Vortex Rings

The present study involves a novel computational technique, regarding simultaneous use of the pseudo particle method, Poisson integral method and a special-purpose computer originally designed for molecular dynamics simulations (MDGRAPE-3). In the present calculations, the dynamics of two colliding vortex rings have been studied using the vortex method. The present acceleration technique allows...

متن کامل

Numerical Tests for the Recovery of the Gravity Field by Fast Boundary Element Methods

The purpose of this paper is to test the applicability of a fast boundary element method in the context of geoid computations of the gravity. The fast multipole method is the method of choice due to the its advantageous property of a fast evaluation in the post-processing. Several sets of trianglar meshes for the approximation of the unit sphere and several modifications of the prediscribed dat...

متن کامل

RPYFMM: Parallel Adaptive Fast Multipole Method for Rotne-Prager-Yamakawa Tensor in Biomolecular Hydrodynamics Simulations

RPYFMM is a software package for the efficient evaluation of the potential field governed by the Rotne-Prager-Yamakawa (RPY) tensor interactions in biomolecular hydrodynamics simulations. In our algorithm, the RPY tensor is decomposed as a linear combination of four Laplace interactions, each of which is evaluated using the adaptive fast multipole method (FMM) [1] where the exponential expansio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1405.7487  شماره 

صفحات  -

تاریخ انتشار 2014